Combining image, voice, and the patient's questionnaire data to categorize laryngeal disorders
نویسندگان
چکیده
OBJECTIVE This paper is concerned with soft computing techniques for categorizing laryngeal disorders based on information extracted from an image of patient's vocal folds, a voice signal, and questionnaire data. METHODS Multiple feature sets are exploited to characterize images and voice signals. To characterize colour, texture, and geometry of biological structures seen in colour images of vocal folds, eight feature sets are used. Twelve feature sets are used to obtain a comprehensive characterization of a voice signal (the sustained phonation of the vowel sound /a/). Answers to 14 questions constitute the questionnaire feature set. A committee of support vector machines is designed for categorizing the image, voice, and query data represented by the multiple feature sets into the healthy, nodular and diffuse classes. Five alternatives to aggregate separate SVMs into a committee are explored. Feature selection and classifier design are combined into the same learning process based on genetic search. RESULTS Data of all the three modalities were available from 240 patients. Among those, 151 patients belong to the nodular class, 64 to the diffuse class and 25 to the healthy class. When using a single feature set to characterize each modality, the test set data classification accuracy of 75.0%, 72.1%, and 85.0% was obtained for the image, voice and questionnaire data, respectively. The use of multiple feature sets allowed to increase the accuracy to 89.5% and 87.7% for the image and voice data, respectively. The test set data classification accuracy of over 98.0% was obtained from a committee exploiting multiple feature sets from all the three modalities. The highest classification accuracy was achieved when using the SVM-based aggregation with hyper parameters of the SVM determined by genetic search. Bearing in mind the difficulty of the task, the obtained classification accuracy is rather encouraging. CONCLUSIONS Combination of both multiple feature sets characterizing a single modality and the three modalities allowed to substantially improve the classification accuracy if compared to the highest accuracy obtained from a single feature set and a single modality. In spite of the unbalanced data sets used, the error rates obtained for the three classes were rather similar.
منابع مشابه
Comparing the Voice Handicap Index Scores in Groups with Structural and Functional Voice Disorders
Objective: The effects of voice disorders vary from person to person. Occupation, work environment, life, and family reaction are variables that affect one’s perception of his/her own as an impaired voice. Voice Handicap Index (VHI) has not yet been used to compare the degree of voice disorders. Assuming that the quality of life may be different under a variety of voice disorders and that diffe...
متن کاملVoice Recovery in a Patient with Inhaled Laryngeal Burns
Introduction: Laryngeal burns cause long-term voice disorders due to mucosal changes of the vocal folds. Inhalation injuries affect voice production and result in changes in the mucosal thickness and voice quality. Case Report: A 47-year-old woman was transferred to our department with laryngeal burns sustained during a house fire. On laryngoscopic examination, mucosal waves of both vocal fol...
متن کاملThe Study of Vocal Function in Patients With Early Laryngeal Carcinoma After Transoral Laser Microsurgery
Objective Today transoral laser microsurgery is considered as one of the first options to control early laryngeal cancer, and voice disorder is one of the inevitable complications of this therapeutic component. This study aimed to compare the vocal function in patients with early-stage laryngeal cancer following laser surgery with healthy individuals with normal voice quality using acoustic ana...
متن کاملEtiologies of Dysphonia in Patients Referred to ENT Clinics Based on videolaryngoscopy
Introduction: Laryngeal dysfunction may be divided into three categories; organic, neurologic and functional disorders. Dysphonia and hoarseness are the most common symptoms and, in some cases, the only signs of laryngeal dysfunction. In differential diagnosis of any type of chronic hoarseness, a neoplastic process must be considered and, thus continuous light video laryngoscopy can provide imp...
متن کاملImproving Voice Outcomes After Injury to the Recurrent Laryngeal Nerve
Objectives: The present study aimed to determine the voice outcomes before and after the administration of voice therapy in patients who suffered an injury to the recurrent laryngeal nerve after undergoing thyroidectomy. Methods: The sample consisted of 26 patients (2 males and 24 females) aged between 18 and 80 years (m=55±12) who experienced injury to the recurrent laryngeal nerve fol...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Artificial intelligence in medicine
دوره 49 1 شماره
صفحات -
تاریخ انتشار 2010